Self-interacting random walks 

Yuval Peres* Serguei Popov''' Perla Sousi"'" 

■(:s4; February 27, 2013 

o 

(N 



in 



Abstract 

Let III, . . . , Hk be d-dimensional probability measures in M'' with mean 0. At each step we 
choose one of the measures based on the history of the process and take a step according to 
that measure. We give conditions for transience of such processes and also construct examples 
of recurrent processes of this type. In particular, in dimension 3 we give the complete picture: 

Ph ' every walk generated by two measures is transient and there exists a recurrent walk generated 

P^ , by three measures. 
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^ 1 Introduction 

0^ 

_j. ■ Let /ii and ^2 be two zero mean measures in M^ with finite supports that span the whole space. 

cn ■ On the first visit to a site the jump of the process has law fii and at further visits it has law fi2- 

^T) [ The following question was posed in [2]: Is the resulting walk transient? 

^\i . More generally, one can consider any adapted rule (i.e., a rule depending on the history of the 

process) for choosing between /ii and fi2, and ask the same question. It turns out that the answer 
to this question is positive, even in M'^, as proved in Theorem 1 1 . 2 1 b elow . Moreover, in 3 dimensions 
this result is sharp, in the sense that one can construct an example of a recurrent walk with three 
measures, as shown in Theorem 11.51 



^ 



This naturally fits into the wider context of random walks that are not Markovian, namely where 
the next step the walk takes also depends on the past. Recently there has been a lot of interest in 
random walks of this kind. A large class of such walks are the so-called vertex (or edge) reinforced 
random walks, where the walker chooses the next vertex to jump to with weight proportional to 
the number of visits to that vertex up to that time; see e.g. [U [U [lOl [TTl [13]. Another class of such 
walks is the so-called excited random walks, when the transition probabilities depend on whether 
it is the first visit to a site or not, see e.g. O [H [HI [HI [TJ]. 

In this paper we study transience and recurrence for walks in dimensions 3 and above that are 
generated by a finite collection of step distributions. We now give the precise definition of the 
walks we will be considering. 
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Definition 1.1. Let /ii,...,/ifc be k probability measures in M and for each j = l,...,k, let 
^^, ^21 • • • be i.i.d. with law fij. Define an adapted rule i = {^{i))i with respect to a filtration (J-i) to 
be a process such that i{i) G {1, . . . ,k} and is J-i measurable for all i. We will say that the walk X, 
with Xq = 0, is generated by the measures fii, . . . , fik and the rule i if 

Xi+i = Xi + Ci+i- 

We say that a measure fi in M^ has mean if f^a xfi{dx) = 0. Also we write that a measure fi 
has /3 moments, if E \\Z\\^ < oo, where Z ~ /i. We define the covariance matrix of /U as follows: 
Cov(^) = inZ.Z,])^.^,. 

Note that if /i is a measure in W^, then it has an invertible covariance matrix if and only if its 
support contains d linearly independent vectors of R . We will call such measures d-dimensional. 

In this paper we are mainly interested in the following two questions: 

• Let /xi, . . . , /Xfc be mean probability measures in W^. What are the conditions on the measures 
so that for every adapted rule i the resulting walk is transient? 

• For a given dimension d, how do we construct examples of recurrent walks generated by k 
d-dimensional mean measures? How small can this number k be made? 

In Section n. II we state our results concerning the first question and in Section [1.21 about the second 
one. Observe that Theorems 11.21 and 11.51 give a complete picture in dimension 3: any two mean 
measures with 2 + /? moments, for some /3 > 0, always generate a transient walk, while there is an 
example of a recurrent walk generated by three 3-dimensional measures of mean with a suitable 
adapted rule. 

1.1 Conditions for transience 

Theorem 1.2. Let f-ii, fi2 b^ d-dimensional measures in W^ , d > 3, with zero mean and 2 + /? 
moments, for some (3 > 0. If X is a random walk generated by these measures and an arbitrary 
adapted rule i, then X is transient. 

The following result will be used in the proof of Theorem 11.21 but is also of independent interest, 
since it gives a sufficient condition on the covariance matrices of the measures used in order to 
generate a transient random walk X for an arbitrary adapted rule i. 

For a matrix A we write A for its transpose, Amax(^) for its maximum eigenvalue and tv^A) for 
its trace. 

Theorem 1.3. Let /ii, . . . ,/ifc be mean measures in W^, d > 3, with 2 + /3 moments, for some 
/? > 0. Suppose that there exists a matrix A such that for all i we have 

triAMiA^) > 2X^^,{AM,A^), (1.1) 

where Mi is the covariance matrix of the measure ^i. If X is a random walk generated by these 
measures and an arbitrary adapted rule i, then X is transient. 

We will refer to (jl.ip as the trace condition. 

It turns out that the local central limit theorem implies the following lower bound on the number 
of measures needed to generate a transient walk. 



Proposition 1.4. Let ^i, . . . , fi^ be mean measures in d> 2k + l with 2 + /3 moments, for some 
/3 > 0. Then the random walk X generated by these measures and an arbitrary adapted rule £ is 
transient. 

We will prove Proposition 11.41 in the beginning of Section [2] and then Theorems 11.31 and 11.21 in 
Sections 12.11 and 12.21 respectively. Then in Proposition 12.61 in Section 12.31 we discuss the case when 
the covariance matrices are jointly diagonalizable. We present a conjectured sufficient condition for 
transience at the end of the paper. 

1.2 Recurrence 

We now define a random walk in d dimensions, which is generated by d measures that are fully 
supported in W^ and we will prove that it is recurrent. 

Let So, ... , Cd-i be the coordinate vectors in Z . We consider a random walk (X„, n = 0,1,2,...) 
on Z , (i > 3, defined in the following way. Fix a parameter 7 > 0, and for x = (xq, • • • , Xd-i) G ^ 
define q{x) = minj/c : \xk\ = maXj=o,...,d-i \xj\}. Then 

where ^„+i = ibeg(x„) with probabilities 2{-i+d-i) ^^"^ ^n+i = ±6^ for k 7^ Q{Xn) with probabilities 
of 4-d-ii • ^"^^ words, we choose the maximal (in absolute value) coordinate of Xn with weight 7 and 
all the other coordinates with weight 1, and then add 1 or —1 to the chosen coordinate with equal 
probabilities. 

Theorem 1.5. For each d > 3 there exists large enough jd such that the random walk X is recurrent 

for all 7 > 7d. 

We will prove Theorem 11.51 in Section [3j The proof of this result relies on the explicit construction 
of a suitable Lyapunov function, but it is rather involved, so in Section [3] we also give simpler 
examples of a finite number of d-dimensional measures and adapted rules that generate a recurrent 
walk in d dimensions. 



2 Proofs of transience 

In this section we give the proofs of the results on transience. We first prove Proposition II. 4^ since 
its proof is short and elementary. 

Proof of Proposition 11.41 In order to prove this proposition, let us first give an equivalent def- 
inition of the random walk that we are considering. 

For each j = 1, . . . ,k, let (^^, (^2) • • • be i.i.d. with law fij. For an adapted rule i we define for all 

je {!,..., k} 



ij,i)= J^l(£(m)=i) 



m=l 



and then writing r^ = r{i{i), i) + 1 we let 



Xi+i — Xi + (. . 



It is easy to see by induction that the process X has the same law as the process of Definition 11.11 
Let R > and for every n we define the event 



Ar. 



k ij 

3 n, . . . , 4 > : ii + . . . + ifc = n and ^ ^ C/ e B(0, R) [ . 

0=1 i=\ 



We now fix a choice of ii, . . . , ifc such that ii + . . . + i^ = n. Then by O Corollary/Theorem 6.2] 
we get for a positive constant c 

since there must exist some ij which is at least njk. It is easy to see that the total number of 
fe-tuples (ii, . . . ,ifc) with ij > for all j and ^- ij = n is equal to (^Z^)- Since ("^J^ ) — cin'^'"-'^, 
for a positive constant ci, we deduce that 



^d/2 ^d/2-k+l ' 

which is summable if d > 2k + 1. Hence, from Borel-Cantelli we obtain that a.s. only finitely many 

of the events An happen. 

Now notice that for every n we have 

{X„Ee(0,i?)}CA„, 

and hence we deduce that a.s. for all sufficiently large n, the random walk at time n will stay 
outside of the ball S(0, R). Since this is true for any i? > 0, we get that if d > 2A; + 1 the random 
walk is transient. D 

2.1 Trace condition and transience 

In this section we give the proof of Theorem 11.31 First we state and prove some preliminary results. 
The following lemma is a standard result, but we state and prove it here for the sake of completeness. 

Lemma 2.1. Let {St) be a random walk generated by k zero mean measures and an arbitrary adapted 
rulei. Let Tt = a{So, . . . ,St) be its natural filtration. Let a, r^ > and define ip{x) = WxW'" ArQ^. 
Lf the process {ip{St)) is a super-martingale, then S is transient, in the sense that a.s. 

\\St\\ — )■ CO as t — )• oo. 

Proof. We first show that a.s. 

limsup \\St\\ = oo. (2.1) 

Indeed, there exist u G S'^^^, e > and /i > such that for all j G {1, . . . ,k} 

F{{Zj,u) >e)>h, 
where Zj ~ ^j. This implies that for all tti, n E N we have 

F{{Sn+m-Sn,u)>em\Tn)>h"'. 



Hence this shows that a.s. hnisup^ KStju)] > em/2 for ah m, and so (j2.ip holds. Clearly, this 
implies that a.s. 

lim inf (^(S't) = 0. (2.2) 

Since {ip{St))t is a positive super-martingale, the a.s. super-martingale convergence theorem gives 
that liuit^oo fiSt) exists a.s. and thus from (j2.2p we deduce that a.s. liuit^oo fiSt) = 0, which 
means that a.s. \\St\\ — >■ oo as i — >• oo. D 

The following lemma shows that if the covariance matrices of the measures used to generate the 
walk X satisfy the trace condition (jl.ip . then there is a function ip such that (p{X) is a super- 
martingale. 

Lemma 2.2. Let (p{x) = ||x|p" A 1, for x € M . Let fii,...,fik be zero mean measures in M 
with 2 + /3 moments, for some /3 > 0, and with covariance matrices Mi, . . . ,Mk satisfying for all 
i = 1, . . . ,k 

tr(Mi) > 2Ai„ax(Mi). 

There exists a > small enough and a constant tq so that if \\x\\ > tq, then for all i = 1, . . . ,k if 

E[ip{x + Z,) - ip{x)] < 0. (2.3) 

Proof. It suffices to prove (j2.3p for a fixed i. Since the covariance matrix Mi is positive definite, 
there is an orthogonal matrix U such that UMiU'^ is diagonal with non- negative eigenvalues. The 
matrix UMiU is the covariance matrix of the random variable UZi. 
Since U is orthogonal, we get that for all x 

(f{U{x + Zi)) = (f{x + Zi) and (p{Ux) = (p{x). (2.4) 

In order to prove the lemma, we will apply Taylor expansion up to second order terms to the 
function if around Ux evaluated at UZi. We will drop the dependence on i from UZi and write 
simply Z and x instead of UZ and Ux in view of (j2.4p to lighten the notation. 

So, let Z have covariance matrix M which is in diagonal form and with diagonal elements Ai, . . . , A^. 
Let Z = Zl{\\Z\\ < ||x||/2). Note that if a.s. \\Z\\ < B for a positive constant B, then Z = Z \i 
\\x\\ > 2B. The calculations below are a bit simpler in this case, since Z would have mean and 
the same covariance matrix as Z. 



If 11x11 > 2, then 



x + Z 



> 1 and so Lp{x + Z) 



x + Z 



. In what follows we abbreviate 



, _ Mx) „ ,. _ gV(^) ,„ , . _ 5V(^) 



dxi ' "-^ dxidxj ' ^^ " dxidxjdxk 

Applying Taylor expansion to (p up to second order terms gives for some rj G (0, 1) 

-I d -. d 

^{x + Z) = ip{x) + {V^{x),Z) + -^ip'l^{x)Z,Zj + - J2 ^'U^ + vZ)ZiZjZk 

i,j=l i,j,k=l 

-, d 1 '^ 

= ^{x) + {Vv{x),Z) + -Y,'PU^)Z^ZJ + ^^ Y, v'lidx + vZ)ZiZ,Z, 

i,j=l i,j,k=l 



x: <-(x)^.z,i (iizii > M^ . 



Claim 2.3. There exist positive constants C, Ci such that for all i,j 

C 



E 



{Vip{x),Z) 



< 



X 



\a+l3+2 



and E[ZiZjl{\\Z\\ > \\x\\/2)] < 



Ci 



Proof. By Holder's inequality we have 



E[||Z||1(||Z|| > ||x||/2)] < 



2/^+iEn|Z||/3+21 K 



1/3+1 



< 



1/9+1 ■ 



Since E[Z] = 0, we have E 



E 



Z - Z 



, and hence 



E 



Z 



|E[Z1(||Z|| > ||x||/2)] II <E[||Z||1(||Z|| > ||x||/2)] < 



K 



1x11/3+1' 



For the first term of the Taylor expansion we have for a positive constant C 



E 



{V^{x),Z)]\ = Y, 






\x 



\a+2 



E 



Zi 



< 



E 






\a+2 



E 



< 



adXllxl 



C 



|a+/3+3 



For all i,j we have by Holder's inequality again 

E[ZiZjl{\\Z\\ > \\x\\/2)] < K\\\Zfl{\\Z\\ > \\x\\/2) 

thus proving the claim. 

We continue proving Lemma |2.2[ For the second order terms we write 



\x 



|a+/3+2- 



< 



Ci 



\x 



1/3' 



E 



Z'iZj 



E[ZiZjl{\\Z\\ < \\x\\/2)] =E[ZiZj] -E[ZiZ,l{\\Z\\ > \\x\\/2)] 
and hence since for i ^ j we have E[ZiZj] = 0, by Claim [23] we get 

d 



E 



ZiZj 



< - — -o and 

Wt.wp 



^^f,(x)E[z2l(||Z||>||x||/2) 



i=l 



< 



Co 



\a+l3+2' 



Since for all i we have E[Z?] = Aj, we obtain 



±^i.^[zn ^ ± A. -""-":;:!; " ^'-- . ^ ff!^(" ^ ^) - Et. A,: 



i=l 



i=l 



i=l 



X 



|Q+4 



D 



(2.5) 



The rest of the second order terms can be bounded as follows: 



^(/9-'j-(x)E 



ZiZj 



E 



a{a + 2)|xj||xj 

\\^\\a+A 



E 



ZiZj 



Q;(a + 2)1x^1 |xj I Ci 



E 



\a+A 



Cs 

||a+/3+2- 



For the remainder in the Taylor expansion we have 



max 



^TA^ + r]Z) 



< 



c 



< 



Ci 



|x + r/Z||"+3 \\x 



|a+3' 



since 



Z 



< \\x\\/2. We want to control E[(/?(x + Z) — f{x)]. We write 

E[ip{x + Z)- ^{x)] = E L(x + Z)- ip{x + Z)l + E L(x + Z)- ^{x) 



(2.6) 



and by Markov's inequality since E 



\Z 



|2+/3 



< OO 



E 



Lp{x + Z) - (p{x + Z) 



<F(\\Z\\ > llxll/2) < 



C, 



1/3+2 ■ 



Since /3 > 0, if we take < a < /3, then we obtain that there exists a constant tq > 1 so that for 

||x|| > Tq 



E 



{Vvix),Z) 



1 

+ 2 






jj^j 



z,-za iizii > 



1 



+ ^ E \E^'U^ + vZ)ZiZ,Z, 



i,j,k=l 



+ 



E 



(^(x + Z) - (f{x + Z) 



< 



- E 9'-,(x)E[Z,Z,] 



ij=^ 



(2.7) 



The assumption on the trace of the matrix M gives that for a small enough (smaller than /?) 
Yl,j=i ^'j > ^i[ot + 2) for all z, and hence using (|2.5|) we get for ||j;|| > tq 



E'^^U^)IE[^f]<0. 
«=1 

This and the inequality (j2.7p finishes the proof. 

We now have all the required ingredients to give the proof of Theorem 11.31 



D 



Proof of Theorem 11.31 Let rg > 1 be the constant of Lemma 12.21 Let ip{x) = \\x\\ "- Ar^^, for 
a > as in Lemma 12.21 Notice that when ||x|| > ro, then (p{x) = (p{x) = ||x||~". We will first show 
that if Yt = AXt, then 

n^t+i) I ^t] < ^{Yt). (2.8) 

Since tq > 1, we have <f{x) < Lp{x) for all x. So we get 

n^{Yt+i) - ^{Yt) I Tt] = n{^{Yt+i) - ^{Yt))±{\\Yt\\ > ro) | Tt] 

+ E[{ifiYt+,)-^{Yt))li\\Yt\\<ro)\Tt] 
< E[{^{Yt+,) - ^{Yt))l{\\Yt\\ > ro) | Tt] , 

since ^{Yt) = r^^" if \\Yt\\ < rg and (p{x) < r^" for all x. Since the covariance matrices of the 
measures used to generate the walk Y satisfy the trace condition (jl.ip . Lemma 12.21 gives that 

EMYt+,) - ^iYt))li\\Yt\\ > ro) | -F*] < 

and this completes the proof of ()2.8p . Therefore by Lemma [2. II we get that a.s. ||AXj|| = ||1^|| — )• oo 
as t — )• oo. Since for all t we have ||AXj|| < ||j4|| llStll and ||j4|| > 0, we deduce that a.s. 



IX, 



which concludes the proof of the theorem. 



oo as t — )■ oo, 



D 



2.2 Two measures in 3 dimensions 

In this section we give the proof of Theorem 11.51 

Proposition 2.4. Let Mi,M2 be 3 x 3 invertible positive definite matrices. Then there exists a 
3x3 matrix A such that 

tr(^Mi^'^) > 2X^^^{AM,A^) V i = 1,2. 

Proof. We prove Proposition 12.41 by constructing the matrix A of Theorem 11.31 directly. 

Let /Ui,/i2 have covariance matrices Mi and M2 respectively and ^i ~ /x^ for i = 1,2. Since Mi is 
positive definite, there exists an orthogonal matrix U such that UMiU^ is diagonal, i.e. 

UMiU^ = 




where a,b,c > are the eigenvalues of Mi. If we now multiply the vector U^i by the matrix D 
given by 

D = 



J_ 








/E 









1 

7b 











1 



then Cov{DUS,i) = I, where / stands for the 3x3 identity matrix. 

So far we have applied the matrix DU to the vector ^1 and we have to apply the same transformation 
to the vector ^2- The vector DU^2 will have covariance matrix M2. Since it is positive definite, it 
can be diagonalised, so there exists an orthogonal matrix V such that 

VM2V'^ = 




where Ai > A2 > A3 > are the eigenvalues in decreasing order. Applying the same transformation 
to DU^i is not going to change its identity covariance matrix, since V is orthogonal. 
The condition we want to satisfy is 

Ai + A2 + A3 > 2A„ 

for all i = 1,2,3. Since the eigenvalues are in decreasing order, it is clear that this inequality is 
always satisfied for i = 2,3. Suppose that A2 + A3 < Ai. Multiplying DU^2 by the matrix 



B 




will give us a random vector with covariance matrix 




which clearly satisfies the trace condition (jl.ip . Multiplying VDUS^i by the same matrix will give 
us a vector with covariance matrix 

4^ 

1 
1 

which satisfies the trace condition (jl.ip . since A2 < Ai. D 

Proof of Theorem II. 2L By projection to the first three coordinates, it is clear that it suffices to 

prove the theorem in 3 dimensions. 

In d = 3, the statement of the theorem follows from Theorem 11.31 and Proposition 12. 4[ D 

Remark 2.5. It can be seen from the proof of Proposition 12.41 that if the measures ni and /_f2 are 
supported on any 3 dimensional subspaces of M , then a walk X generated by these measures and 
an arbitrary adapted rule is transient. 

2.3 The diagonal case 

In this section we consider a particular case when for some basis of M the covariance matrices are 
in diagonal form and invertible. In this setting we prove that a random walk generated by d — 1 
measures and an arbitrary rule i is transient. 

Proposition 2.6. Let d > 4 and fii, . . . , fJ-d-i be mean probability measures in W^ with 2 + /3 
moments, for some /3 > 0. Let Mi, . . . , M^-i be their covariance matrices and suppose that MiMj = 
MjMi for all i,j. Then there exists a d x d matrix A such that 

tT{AMiA^) > 2Amax(^Afj^^) \f i<d-l. 

Therefore, a random walk X generated by the measures (/Uj)^~-^ and an arbitrary adapted rule £ is 
transient. 

Before giving the proof of Proposition 12.61 we prove the following: 

Claim 2.7. Let Mi,. . . ,M^ be d x d invertible diagonal matrices with positive entries on the 
diagonal. For A ^ we define 

^{A) = max '' /^ '' . (2.9) 

i<j<ktr{AMjA^) ^ ' 

Then the minimum o/\^(A) exists among all diagonal matrices A and the minimizing matrix A is 
invertible. 

Proof. Since Mj is an invertible positive definite matrix, we can write Mj = BjBJ = B^, where 
Bj = BJ is an invertible matrix. 

Since scaling A does not change the ratio in (|2.9p . we may assume that ||^|| = 1 and restrict 
attention to such matrices. It is easy to see that the set S = {A diagonal : ||^|| = 1} is compact 
and the function fj{A) = \\AMjA \\ is continuous on S. 

Let gj{A) = tiiAMjA^) = tiiABjBjA^) = IWABjlW^, where |||C|||2 = Y!lj=iclj and we used 



Since Bj is invertible, we have ABj ^ for yl 7^ 0, so gj does not vanish on S. Thus as gj is 
continuous on 5, we conclude that 

A I— 7- max — i—r- 
^<j<kgj{A) 

is continuous on S and hence has a minimum. 

Let A be the minimizing matrix with diagonal elements Ai, . . . , A^ > 0. We will show that A is 
invertible. Suppose the contrary and assume without loss of generality that A^ = 0. 
We prove that if we replace A^^ = by a small e > 0, then we get a matrix Af, with ^{Af,) < ^{A). 
Let the diagonal elements of Mi be (ap^^i, which are all strictly positive. Then for the matrix Mi 
we will have for s such that ||M,|| = al 

An,ax(lMiI) Xsal 



tT{AMiA) Ej -^jOj 
If A^ has the same elements as A except for the (d, d) element which is replaced by e > such that 

A a^ 

e < -^ for all i = 1, . . . , d — 1 and all j = 1, . . . ,d — 1, then 

d 

tT{AeMiA,) = tr(lMil) + ea^, 

while Amaxl^eM.Ie) = Amax(lMil). 

Replacing each element of j4 by a sufficiently small number gives a matrix with smaller value 
of ^, which contradicts the choice of A. Hence this shows that A is invertible. D 

Proof of Proposition 12.61 Since MiMj = MjMi for all i,j, it follows (see for instance [3 Theo- 
rem 2.5.5]) that there is one orthogonal matrix that diagonalizes all the matrices Mj. So from now 
on we suppose that the Mj's are diagonal. 

Recall the definition of ^ from (j2.9p . Let A be the dx d invertible matrix that minimizes ^ among 
all diagonal matrices (recall Claim [277|) . 



Write Mi = AMiA'^ and 



J=\j<d-1: ^5^ = ^{A) 
I tr(M,) 



Since A and Mj are diagonal invertible matrices, it follows that Mj is also a diagonal invertible 
matrix. For each j < d — 1 we can find Vj G M such that \\vj\\ = 1 and MjVj = \\Mj\\vj. Note that 
since Mj is diagonal, it follows that Vj can be chosen to be one of the standard basis vectors of M"'. 
Let w G M*^ have \\w\\ = 1 and w _L {vi, . . . , Vd-i}- Then w will also be one of the standard basis 
vectors of M . 

Next, we separate two cases. 

Case 1: For some j £ J there is Uj _L Vj with \\uj\\ = 1 and MjUj = ||Mj||uj. In this case, 

tr(Mj) > {MjVj,Vj) + {MjUj,Uj) = 2||M,-||, (2.10) 

where the strict inequality follows from the fact that Mj is invertible. Hence in the case where 



Mj 



has multiplicity at least 2, we are done. 
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Case 2: For each j £ J the leading eigenvalue ||Afj|| of Mj has multiplicity one. We will show 
that this case leads to a contradiction; that is we can find another matrix with smaller value of ^ 
contradicting the choice of A as the minimizer. 

Let A^ be the dx d matrix such that Ai^w = (1 + e)w and Af^z = z for all z -L w. Note that A,, will 
also be diagonal, since w is one of the standard basis vectors of M'^. 

Let us denote by 7j the second largest eigenvalue of Mj. Then the assumption of case 2 implies 
that for each j £ J we have jj < \\Mj\\ and ||Mjy|| < 7j||y|| for all y _L Vj. 

Choose e > such that (1 + e)^||Mj|| < tr(Mi)^(A) for alH ^ J and (1 + ef-fj < \\Mj\\ for all 

J£J. 

Note that since A;, is diagonal, Aj = yl^ and AAg. is diagonal satisfying 

^iAj)= max "^^^^^" . 
l<^<d-l tr{AsM^Ae) 

By completing {w, Vj} to an orthonormal basis {bm}\rn=i °f ^ ^^ ^^^ that for alH < d — 1 

ti{A,MiAs) > tr(Mi), (2.11) 

since tr(M) = X^m=i(-^^m) bm) for any matrix M and any orthonormal basis. The strict inequality 
follows again from the fact that the matrix Ai^MiA^ is invertible. Also 

WAsMiAeW < \\Aef\\Mi\\ = (1 + e?\\Mi\\ 

II ^ '' ^11 — II ^11 II ''II \ / II ''II 

and for j £ J we have for all y _L Vj 

\\AeMjAey\\ < {l + e)\\Mj{Aey)\\ < {1 + ehj\\Aey\\ < {1 + ef-fj\\yl 

since A^y ± Vj. 

We conclude that ^(Ai^A) < ^{A) by considering separately in the max defining ^ the indices i ^ J 
and i £ J, and applying (j2.1ip . This contradicts the choice of ^ as a minimizer and establishes 
that case 2 is impossible. D 

3 More measures may yield a recurrent walk 

In this section we prove that the random walk described in Section [1.21 is recurrent. First we give 
the simpler example that was mentioned in the Introduction. 

Let S'^~^ he the d-dimensional unit sphere, i.e. S'^~^ = {x £ W^ : ||a;|| = 1}. Let Ci, . . . , C^ be caps 
that cover the surface of the sphere with the property that the angle between any two vectors from 
the origin to points on the same cap is strictly smaller than 7r/2. For every cap Cj, for i = 1, . . . ,k, 
we write m{Ci) for the vector joining to the center of the cap Cj. Then we choose Vi^i, . . . , v, ^-i 
to be d — 1 orthogonal vectors on the hyperplane orthogonal to m[Ci). 

For every x £ W^, we write C{x) for the first cap in the above ordering such that the vector joining 
and X intersects that cap. 

Theorem 3.1. Let X be a walk in M that moves as follows. When at x it moves along the direction 
ofm{C{x)) either +1 or —1 each with probability 1/2 and along each of the other d—1 directions, 
i.e. along the vectors fi(x),i) • • • )''^i(x),d-i ^^ moves independently as follows: ±1 with probabilities 
e/2 and stays in place with the remaining probability. Then X is a recurrent walk, i.e. there is a 
compact set that is visited by X infinitely many times a.s. 
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Remark 3.2. It can be shown that the ratio of the area of the unit sphere to the area of a cap 
as defined above with angle 7r/2 is equal to 2//1/2 (— 5— j 2)1 where I is the regularized incomplete 
beta function. It is then elementary to obtain that the last quantity can be bounded below by 
2^/2+1 y ^ gQ iiiQ^i i]2 the above theorem at least 2'^'^"'"^ measures are needed. 



Proof of Theorem 13. ll We define (p{x) = log||x||, for x E W^. Then by Taylor expansion to 
second order terms we obtain for some r/ G (0, 1) 



For each i and positive constants C,Ci, since Z is bounded, we have 



3! ^-^ dxidxjdxj. 

i,i,k=l ■^ 



dip 
dxi 



Xo 



d^ip Y^j^i ^i 



dx'i 



and max 



d^ip{x + r]Z) 



dxidxndxi 



< 



Ci 



\x + r/Z||' 



< 



C 



13' 



Let ui, . . . ,Ud be the vectors (basis of M ) as defined in the theorem. We now write both x and Z 
in this basis, i.e. we have that x = X]j=i XjMj and Z = ^1,1=1 ^iUi. Then for i 7^ j, by independence, 
we get that K[ZiZj] = 0, while E[-Z^il = 1 and for alH > 1 we have that ]E[.^fl = £■ Hence, putting 
all things together we obtain that 



ES^nz.z, 



{1 + eid - 3))\\x\\^ + 2ie - l)xl 



«j 



dxidxj 



(3.1) 



For the first coordinate xi of x, when decomposed in the basis described above, we have that 

xi = \\x\\ cos 9, 

where 9 is strictly smaller than 7r/4, so there exists 5 > so that cos^ > (1 + 5)\/2/2. Hence, we 
can now bound (]3.ip from above by 



XT 



2(e - 1) + 



2(1 + 5)2 



{l + e{d-3))], 



which can be made negative by choosing e small enough. Notice that in absolute value the last 
expression is at least c||a;||~ for a positive constant c, and hence since Z has mean 0, it follows 
that for llrrll large enough we have 

'd'^f{x + r]Z) 
dxidxjdxk 



1 y E 



i,j,k=l 



Zi Zj Zk 



1 
< - 

- 2 



E 



«j= 



d'^ip{x) 

' dxidxj 
1 ■' 



E[Z,Zj-] 



Therefore we deduce that for ||x|| > tq 

E[v9(x + Z) - ip{x)] < 0. 



(3.2) 



We now show that this implies recurrence. By the same argument used to show (|2.ip we get that 
a.s. 

limsup \\Xt\\ = 00. 

Let Trg = mf{t > : Xt £ B{0,ro)}. By (13. 2|) we obtain that (^(XtATrg) is a positive super- 
martingale. Hence the a.s. martingale convergence theorem gives that limj_>.oo fiXtAT,- ) = ^ 
exists a.s. and is finite. If T^q = 00 with positive probability, then since (p{x) — )• 00 as x — )■ 00, then 



lim(^(Xj 



tATr, 



00 with positive probability, which is a contradiction. Therefore, Tj.^ < 00 a.s. D 
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We will now give the proof of Theorem 11.51 

Proof of Theorem 11.51 By [61 Theorem 2.2.1] or analogously to the last part of the proof of 
Theorem 13. 11 to prove recurrence it is enough to find a nonnegative function / such that /(x) — t- go 
as X —7- oo, and 

E[/(X„+i) - f{Xn) \Xn = x]<0 for ah large enough x. (3.3) 

Before presenting the explicit construction of such a function, let us informally explain the intuition 
behind this construction. First of all, a straightforward computation shows that, if y is a simple 
random walk in Z , then 

E[||y„+i|| - ||y„|| I y„ = x] = ^^ + odl^^ll"'), 
E[(||y„+i||-||y„||)2|y„ = x] = i + o(||x||-i). 

One can observe that the ratio of the drift to the second moment behaves as ^r^; combined with 
the well-known fact that the SRW is recurrent for d = 2 and transient for d > 3, this suggests 
that, to obtain recurrence, the constant in this ratio should not be too large (in fact, at most 2)- 
Then, the second moment depends essentially on the dimension, and thus it is crucial to look at 
the drift. So, consider a (smooth in R \ {0}) function g{x) = 0(||x||); we shall try to figure out 
how the level sets of 5 should be so that the "drift outside" with respect to g "behaves well" (i.e., 
the drift multiplied by ||x|| is uniformly bounded above by a not-so-large constant). For that, let us 
look at Figure [TJ level sets of g are indicated by solid lines, vectors' sizes correspond to transition 
probabilities. Then, it is intuitively clear that the case of "moderate" drift corresponds to the 
following: 

• the "preferred" direction is radial, the curvature of level lines is large, or 

• the "preferred" direction is transversal and the curvature of level lines is small; 

also, it is clear that "very flat" level lines always generate small drift. However, one cannot hope 
to make the level lines very flat everywhere, as they should go around the origin. So, the idea is to 
find in which places one can afford "more curved" level lines. 

Observe that, for the random walk we are considering now, the preferred direction near the axes is 
the radial one, while in the "diagonal" regions it is in some intermediate position between transversal 
and radial. This indicates that the level sets of the Lyapunov function should look as depicted on 
Figure [21 more curved near the axes, and more flat off the axes. 

We are going to use the Lyapunov function 

f{x) = J-^)\\xr, 

\ \\x\\ J 

where a is a positive constant and Lp : S 1— t- M is a positive continuous function, symmetric in 
the sense that for any (no, . . . , u^-i) G S"'"^ we have 99(^0, ■ ■ ■ , Ud-\) = ip{Tou^i^o), . . . , Td-iu„{d-i)) 
for any permutation a and any r G { — 1, 1} • By the previous discussion, to have the level sets as 
on Figure [21 we are aiming at constructing ip with values close to 1 near the "diagonals" and less 
than 1 near the axes. 
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case 1: 

■^ 



case 2: 




case 3: 



case 4 




Figure 1: Looking at the level sets: how large is the drift? We have very small drift in case 1, very 
large drift in case 2, and moderate drifts in cases 3 and 4. 




Figure 2: How the level sets of / should look like? 
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By symmetry, it is enough to define the function y? for u S S such that uq > ?/i,...,(i-i ^ 
(clearly, it then holds that uq > 0), and, again by symmetry, it is enough to prove (|3.3p for all 
large enough x G Z of the same kind. For such u G S abbreviate Sj = uj/uq, j = l,...,d — 1; 
observe that, \i u = x/||3;||, then Sj = Xj/xq. We are going to look for the function (for u as 
above) '^{u) = 1 — atp{si, . . . , Sd-i), where ^ is a function with continuous third partial derivatives 
on [0, 1]"^"^ (in fact, it will become clear that the function ip extended by means of symmetry on 
[—1,1] has continuous third derivatives on [— 1,1] , this will imply that o-s in the computations 
below are uniform). 

Next, we proceed in the following way: we do calculations in order to figure out, which conditions 
the function ip should satisfy in order to guarantee that (|3.3p holds, and then try to construct a 
concrete example of ip that satisfies these conditions. 

First of all, a straightforward calculation shows that for any e G Z'^ with ||e|| = 1 we have 



\x + e\ 



X 



1 + a ,, ,,^ + 



a 



1 



\x\ 



2 X 



a(2 



a 



(x,e)^ 



1 



+ o{ X 



(3.4) 



as X — )• oo. 

In the computations below, we will use the abbreviations 

dtl;{si,...,Sd-i) 



V^: = 



t. 



dsj 

9^V'(si,...,gd_i) 

dsidsj 



hJ 



l,...,d-l, 

= l,...,d-l. 



Let us now consider x G Z . From now on we will refer to the situation when xq > xi^,,,^d-i ^ 
as the "non-boundary case" and xq = xi = ■ ■ ■ = Xm > a^m+i > • • • > Xd^i > for some m > 1 
as the "boundary case" . Observe for the boundary case the corresponding s will be of the form 
s = (1, . . . , (l)m> Sm+i, ■ ■ ■ , Sd-i); here and in the sequel we indicate the position of the symbol in a 
row by placing parentheses and putting a subscript. Also, in the situation when only one coordinate 
of the vector s changes, we use the notation of the form V'((^)i) for ^p{sl, . . . , Sj-i, s, Sj+i, . . . , Sd-i), 
possibly omitting the parentheses and the subscript when the position is clear. 

First we deal with the non-boundary case. 

Let us consider x G Z*^ such that xq > xi^,,,^d-i > 0. Again using (|3.4p and observing that (recall 



Xj/Xo) 



write 



xo— 1 



S,(l + Xr.^ + Xr,'^ + o( 



0) and 



XO + l 



Sj{l 



Xq^ + Xq'^ + o{ 



% 



we 



E[/(X„+i) - f{Xn) I Xn = X] 

= -{l-aiP{s))\\xr + 



7 



2(7 + d-l) 



a 



H^i^--- 



ff^J)ll^-eo| 



+ 



2(7 + d-l)^ 



2(7 + d-l) 



+1' 



'^jjll^ + ^ol 



WE[(i-"^(^))ii---.r + (i-"V'(^))i 



\X + Cj 



1 - a^(.) - ag (^ + Ik- - f g ^^^;- + 



0[ X 



15 



X ( 1 - Qt^— 7777 + 



1 



j;||2 2||x|P 2 

d-l 



-a(2-a) 



I 119 II 119 



+ o[\\x\ 












ox 



X 1 + a7r-FT7T + 



1 , 

-a(2-a) 



119 oil 119 o V / II 119 II 119 

xr 2 X r 2 X r xr 



+ X 



2(l-a^(s)) 



+ 






2(7 + d-l)^ 



a 



2(1 - qV'(s)) + 1 - aip{s) + aXfTV' - -^V^' 

J n^^ J.. 



X 1 — a- 



+ 



a 



1 



1 



IxiP ' 2||x|P 2 '^''ll-r.112 ' IUII2 



xr X 



+o(iNr') 



+ ( 1 — aV'(s) — qXq ^' 



^j 2x2 '''■''■'' 






/, X,- a 

X l + a--4Tr + 



-a{2-a)- 



x~, 



1 



ax 



7 + d- 1 



1 - a^(s) (2 — a)(l — a-(/'(s)) Xg 



x||^ IfI, 

2 '^-l 



+o(iNr') 



2||x|| 



2||x|| 



X 



Z_^\^2 \\^\\2)'^3 



i=i 



J^r\ X 



1 X ^ SjSj 



2 .^. x2 "^'^ + 



«J=1 



7 + d-l 



(d-l)(l-a^(s)) (2-a)(l-a^(s)) xg (2 - a)(l - a'(/'(s)) 



2||x||2 



+ 



2||x|P 



2||x||2 



d-l 



1^ 1 



■ 1 K' •^ ■ 1 •''0 



ox 



-a 



x||" ^$(x,V') + a||x||" ^(•^ ^$i(x,^,7,q) +a$2(2;,V',7,a)): 



where ^i and <1>2 ai'e uniformly bounded for large enough x, and 



<^{x,'ip) 



1 llxl 



|2 2 



+ 



d-l d-l 



(3.5) 



(3.6) 



i=i 



*j=i 



The idea is then to prove that, with a suitable choice for tp, the quantity ^(x, V') will be uniformly 
positive for all large enough x, and then the second term in the right-hand side of ()3.5p can be 
controlled by choosing large 7 and small a. This will make (j3.5p negative for all large x. 

Now, in order to obtain a simplified form for p.6p . we pass to the (hyper) spherical coordinates: 



Si = r cosc/i, 

S2 = r sin^i cos 02) 



Sd-2 = r sin 6*1 . . . sin d^-i cos 6'rf_2, 
Sd-i = r sin 6*1 . . . sin 6'd_3 sin 6'd_2- 
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Since 



Figure 3: On the construction of h 



d± ^^A .1,11 - ^ ^ 



1 + r^, and (abbreviating i^'r = -^ and V'r 



d-1 



i=i 



i,j=^ 



we have 



11 2 



1 + r^ / 1 — r^ 



2 V(l+r2)2 



(3.7) 



Now, we define the function ip (it will depend on r only, not on ^i, . . . , 9d^2) in the following way. 
First, clearly, we need to define ip{r) for r G [0, \/d — 1]. Then, observe that 



/d=^ 



1-r' 



i^+r^r 



dr 



d 



>0, 



(3.8) 



so, for a suitable (small enough) Eq we can construct a smooth function h with the following 
properties (on the Cartesian plane with coordinates {r,y), think of going from the origin along 



4ei 



until it intersects with y 



l-r^ 



(l+r2) 



T^- 



and then modify a little bit the curve around the 



intersection point to make it smooth, see Figure [3]): 



(i) < h{r) < -^^^ for all r < 2eo and /i(r) 



l-r^ 



(1+r^) 



Y for r > 2eo; 



~ -i-^ as r — )• 0; 



(ii) h(0) = and h(r) , , 
(ii) JTT^ - ^(^) > ^ for r < Eq; 
(iv) 6 := /f/^^/i(r) dr>0 (by ([MD it holds in fact that b G (0, 1)); 
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(v) Jq h{u) du > 3(^3^ for all r G (0, Vd-l]. 
Denote H{r) = JI h{u) du, so that we have H{\/d — 1) = b. Then, define for r G [0, \/d — 1] 



■0(r) 



/d=T" 



Hiv) 



bv 



'P 3(d - 1)3/2 



dv. 



(3.9) 



For the function -0 defined in this way, we have r tp^r) = , _^ ,3^2 — H{r), so h{r) + {r ■ilj'{r)) 
b{d — l)~3/2j.2_ gy construction, it then holds that 



inf 

re[0,Vd- 



^/(1^ + (-V(0)')>K^-i)-^/^^^a1, 



(3.10) 



and this (recall (j3.6p and (|3.7p ) shows that, if 7 is large enough and a is small enough then the 
right-hand side of ()3.5p is negative for all large enough x G Z . 

To complete the proof of the theorem, it remains to deal with the boundary case. 
Let xq = xi = ■ • • = Xm > Xm+i > • • • > Xd-1 > for some m > 1. Using (13. 4|) (up to the 
term of order ||x||~'^ in the parentheses), using the fact that (p is invariant under permutations and 
observing that xq and ||a;|| are of the same order, we have 



E[fiXn+l) - f{Xn) \Xn = x\ 

7 + m 



-(l-a0(s))||x|r + 



+ 



2(7 + d-l 



(i-«^((^U)l 



x-eol 



f^))ll- + «o- 



+ 



2(7 + d - 



^ d-l 



' l2(7 + d-l) 



1 — ai 



jx + ej\ 



^(^))ii---.ir + (i-"^(¥)) 

(l - o^Uis) - ^ + odlxir^))) (l - a/^ + o(||x||-i) 
V V Xn / / V X r 



fe=l fc=m+l 



2(1-0^^(5)) 
1 



d~\ 



1 - aU{s) - ^ + o(||x||-^))) f 1 - a-^ + o{\\x\\-^) 



2(7 + d-l) ^ 

^ ' ^ j=m+l 

(1 - 0(^^(5) + ^ + o(||x||-i))) (1 + a-^ + o(||x||-i)) - 2(1 - aV(s)) 



ax 



, 7 + m 
2(7 + d-l) 



x| 



^ m— 1 d— 1 

-(^^^^+2^:,+ ^ sfc^^)+o(iix||-i) 

11 ; I 1 



A;=l 



fc=m+l 



(3.11) 
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(observe that in the above calculation all the terms of order ||3;||" ^ that correspond to the choice 
of coordinates m + 1, . . . ,d — 1 of x, cancel). 

Now simply note that by the property (v), we have ip'{r) < for all r G (0, \/d — 1]. Observe also 
that for some positive constant 6o it holds that 'ip'{r) < —6o for all r G [1, \/d — 1]. Then (recall 
that in the boundary case si = 1 and Sj > for all j = 2, . . . , d — 1) we have 

^'. = fi-0' < for all j = 1, . . . , d - 1 and ^Us) < pi=. 

■^ r \/d- 1 

This implies that the right-hand side of (IS.llh is negative for all large enough x G Z and thus 
concludes the proof of Theorem 11.51 D 



A conjecture 

We end this paper with an open question: 

Conjecture 3.3. Let /^i, . . . ,/id-i be d- dimensional measures in M , d > 4, with mean and 2 + /3 
moments, for some /3 > 0, and i an arbitrary adapted rule. Then the walk X generated by these 
measures and the rule i is transient. 

To answer this question, by Theorem 11.31 it suffices to prove the existence of a matrix A satisfying 
the trace condition (jl.ip . So far, we were able to prove it in the case when the d — 1 covariance 
matrices are jointly diagonalizable. 
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